Correlations and Scaling Laws in Human Mobility 
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Human mobility patterns deeply affect the dynamics of many social systems. In this paper, we 
empirically analyze the real-world human movements based GPS records, and observe rich scal- 
ing properties in the temporal-spatial patterns as well as an abnormal transition in the speed- 
displacement patterns. We notice that the displacements at the population level show significant 
positive correlation, indicating a cascade-like nature in human movements. Furthermore, our anal- 
ysis at the individual level finds that the displacement distributions of users with strong correlation 
of displacements are closer to power laws, implying a relationship between the positive correlation of 
the series of displacements and the form of an individual's displacement distribution. These findings 
from our empirical analysis show a factor directly relevant to the origin of the scaling properties in 
human mobility. 

PACS numbers: 89.75.Fb, 05.40.Fb, 89. 75. Da 



I. INTRODUCTION 

The statistical patterns of human daily movements di- 
rectly affect the physical contacts between humans and 
thus deeply impact the dynamics of many social systems. 
The understanding of real-world human mobility pat- 
terns would be much helpful for many aspects in social 
dynamics, such as epidemics spreading the design- 

ing of traffic systems Q, or localized recommendations 
^, Q- Since the pioneering work of Brockmann et al 
^, the temporal-spatial statistical properties in human 
movements have become a new issue in complex sciences 
and have attracted much attention in recent years. 

The most dramatic discovery in the statistical patterns 
of human mobilit y is the existence of wide-spread scal- 
ing properties |8l4l0|. The first one is the power-law- 
like displacement distribution, which has been observed 
in many empirical analyses of real- world human move- 
ments [a |Tl| ageographicnd even in the virtual world 
of online-games [I4I . This result sharply differs with the 
traditional understanding based on random walks, and 
reveals long-range correlations in human travels and so- 
cial interactions. Other scaling properties include the 
staying time distributions which denote that humans usu- 
ally stay in a few locations quite a long time 9] , and the 
visitation frequency distributions are dominated by a few 
locations that are usually much more frequently visited 
0) , and so on. 

Many other abnormal properties are also found in hu- 
man mobility patterns, including ultra-slow diffusionjl, 

, anisotropism 0], high predictability fl3!| , and the lim- 
itation of roads [IJ]. These discoveries reveal abnormal 
features in real-world human mobility, in stark contrast 
to the traditional understandings based on the hypothesis 



of random-walk-like human mobility or on that of Levy 
ffights with the same scaling displacement distributions. 

However, these findings are still facing several contro- 
versies. Due to the limitation of original data, most pre- 
vious works are at the population level, and a direct anal- 
ysis of individuals is rarely seen. Recently, Yan, et al. 
[l5| reported the diversity in individual-level mobilities 
and found out that most of the individuals' displacement 
distributions do not obey the scaling law. Moreover, sev- 
eral recent researches indicated that the move length in 
human urban trips or the travels by a single type of trans- 
portation do not obey well a power law [l6l - [l8| . These 
controversies require the confirmation from a more in- 
depth empirical analysis of human mobility patterns. 

Recent studies also proposed many models to explain 
the underlying mechanisms that drives the emergence of 
these anomalies in human mobility. Generally, the basic 
dynamics of previous modeling works can be divided into 
the following classes: i) The descriptive models: Levy 
flights [HI, Self-similar least action walk (SLAW) [l9|. 
and Continuous-time random walks [sj; ii) The explo- 
ration of new locations and the preferential return to 
visited places [ld|; iii) The effect of hierarchical traffic 
systems [1^ ; iv) the effect of few dominant trips [2l| ; v) 
The spatial heterogeneity of population density or the 
geographic locations [l^, [l^]; vi) The radiation model 
proposed by Simini et. al. [1^, which can reproduce 
many mobility patterns at the global level; vii) The ag- 
gregation of individuals without scaling properties |15{. 
These models can reproduce parts of the empirical find- 
ings. Nevertheless, it is difficult to identify common rules 
from these model, and thus it remains controversial what 
drives the emergence of these abnormal properties in hu- 
man mobility. It would therefore be helpful if the empir- 
ical analysis can identify characteristic factors affecting 



2 




P10 - P15 



P16 



FIG. 1: (Color online) Identification of two different staying 
points. P1-P17 represent 17 track points recorded by a GPS 
equipment from whidi we obtain two staying points SI and 
S2. The displacement of travel is defined as the distance be- 
tween the centers of the two staying points. 



the emergence of these anomalies. 

In this paper, based on the empirical analysis of GPS 
data sets, we report one of the characteristic factors that 
is relevant to the scaling displacement distributions: the 
correlation of the series of displacement. We first show 
the aggregated temporal-spatial properties at the popula- 
tion level (Section II) and then we analyze the correlation 
of the aggregated series of displacements (Section III). 
Finally, we discuss the diversity in individuals' mobility 
patterns and the relationship between the correlation of 
the series of displacements and the scaling properties of 
displacement distributions (Section IV). We show that 
the correlation is indeed a tool that allows to investigate 
the underlying mechanisms from the empirical data. 



II. THE SCALING PROPERTIES AT THE 
POPULATION LEVEL 



The data set in our analysis contains records from 
165 volunteers that have been gathered over three years 
(April, 2007 - Sep., 2010). The GPS trajectories result 
from the Microsoft Research Asia in Geo-life Project [23 - 
[2^. More details can be found in Appendix A. 

We determine the effective staying positions from the 
dataset using a resolution of 10 meters in space and 120 
seconds in time. Fig. [T] illustrates for a case of two stay- 
ing positions, SI and S2, are obtained from a sequence 
of GPS records. Details of our approach can be found 
in Appendix A. The geographical distance between two 
consecutive staying positions, e.g. SI and S2 in Fig. (TJ is 
defined as the displacement of travel. The staying time 
in each staying position is defined as the time interval be- 
tween the first and last GPS records in the given staying 
position. 

Using the above method, we obtain 927 trajectories 
with recording times longer than 6 hours that contain 



19376 effective staying points. The total staying time is 
4463 hours, and the total displacement is 95472.33 kilo- 
meters. From each of these trajectories, we can obtain 
a sequence that contains the staying positions, displace- 
ments and staying times. 

We combine the displacements and staying times in all 
927 files to calculate the displacement distribution and 
the staying time distribution at the population level. Af- 
ter log-binning, the displacement distribution P{d) gen- 
erally obeys the following power-law function with two 
different regimes (Figl2](a)): 



P{d) 



{d < 6.5km), 
{d > 6.5km). 



(1) 



This power-law displacement distribution indicates that 
the typical behavior consists of many short-range trips 
and few long-range travels. This conclusion is in sub- 
stantial agreement with the conclusions of several pre- 
vious findings [lo| . The transition at d ~ 6.5km is 
related to the average extend of the urban district of 
cities, indicating the difference between human urban 
movements and intercity travels. This difference may 
be due to the convenience of urban movements and the 
dominant high-frequency movements between few posi- 
tions (such as home and working places) [H, [2l| . 

A similar scaling property is also observed in the stay- 
ing time distribution P{ts) at the population level, which 
can be well fitted by a power-law function with exponent 
— 1.98 (Fig. [2](b)), indicating that humans usually stay in 
few positions a quite long time. This result is also close to 
previous findings based on other data sets [1, [Tl|, [l6|, ^Ti\ . 

The distribution P{te) of the elapsed time te that indi- 
viduals spend on the way from an effective staying posi- 
tion to the next one has also been studied. As shown 
in Fig. [2] (c), P{te) shows a strange behavior where 
two power-law-like sections are separated by an unusual 
bump when 10"* < <e < 2 x 10^ seconds. It seems that 
this bump results from traffic jams. This result is some- 
what different to the previous findings in urban taxi data 

Moreover, we calculate the average speed Vi — di/tei 
for every user i, and plot each pair {vi,di) on the plane 
to get the pattern of the relationship between speed and 
displacement. We surprisingly find that v ys. d generally 
obeys the bilinear form in a log-log plot, in which the 
first section {d < 10'^ meters) is linear, whereas another 
part {d > lO'^m) is sublinear (slope « 0.5), as shown in 
Fig. [3] The point of transition d w 10"^ meters and w « 1 
m/s, could relate to the length and speed of walking, 
therefore the two sections would correspond to the travel 
by foot or by automobile, with humans preferring a trip 
by automobile (bus, car, etc.) for distances longer than 
1 kilometer. 

In addition, some movements have ultraslow speed, as 
indicated by the dashed line in Fig. [3] The corresponding 
displacements of these ultrashow movements are gener- 
ally between 10^ meters and 10^ meters, and the corre- 
sponding elapsed time is mainly in the range from 10^ 
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FIG. 3: (Color online) The relationship between the average 
speed V and the displacement d, where the slopes of the upper 
lines are 1.0 and 0.5 respectively, whereas the slope of the 
lower dashed line is 1.0. 



seconds to 2 x IC* seconds, corresponding to the bump 
in P{te) displayed in Figl^l possibly indicating displace- 
ments hampered by traffic jams. 



III. CORRELATIONS OF DISPLACEMENTS AT 
THE POPULATION LEVEL 

For eacfi sequence of displacements of individuals, the 
correlation between two consecutive displacements re- 
flects the trends and causal relationsfiip in human travels. 
To get the pattern of the correlation, we plot each of the 
data points (di,di+i) and calculate the density of these 
data points. Here di and di+i denote the i-th and the 
{i + l)-th displacement in the series d. As shown in Fig. 
mja), most of the data points [di,di+i) accumulate close 
to the diagonal line d^+i = di, corresponding to a positive 



correlation. 

We also plot the pattern using the related displacement 
d = d/dr, where c?* is the average displacement of the 
user. We first calculate the average displacement of each 
user and then obtain the sequences of d from each file. 
Fig. Hl^b) shows the pattern of the density of the data 
points {di,di+i), where the positive correlation is much 
clearer. 

Taking into account the heterogeneous d, we use 
the rank-based correlation coefficient named Kendall's 
Tau to quantify the strength of this correlation. We 
first set di — {di,d2,-'' ^di,--- ,djv-i}j and di^i ~ 
{d2,d3,--- ,di+i,--- ,di\[} for every series, where N is 
the total number of the displacements in the correspond- 
ing file. The detailed introduction of Kendall's Tau can 
be found in Appendix B. The value of the Kendall's Tau 
TK — 0.424 for the series di and di+i, and the confidence 
interval with 95% significance level is 0.015. For the re- 
lated displacement series di and d^+i, tj^ = 0.435 which 
represents a significant positive correlation. 

This remarkable positive correlation shows that a trip 
can have effect on the next one: if the current displace- 
ment is long, the next one has a high probability to be 
only slightly different. The change in displacement is 
usually gradual. This gradual change agrees with our 
daily experience. For example, if we travel to another 
city, we first need to find a hotel in the city. The move- 
ment from our city to the target hotel generally is a long 
travel (the length may be several hundred miles). In the 
next several days, we might leave the hotel to visit some 
places around the city (generally tens of miles). During 
each trip, our visit will contain many short moves (usu- 
ally less than one mile). A direct trip from our city to 
the place in the target city rarely appears. 

Furthermore, to investigate the long-term correlations 
in human mobility, we calculate the Kendall's Tau tk 
of the series di and di^m (w = 1, 2, • • • ), and find that 
the function vs. m shows a remarkable slow decay, 
which can be well fitted by a power-law function with an 
exponent —0.276 (Fig. EKa)), implying that the effect of 
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FIG. 4: (Color online) Movement correlation scatter plot at the population level. Taking a displacement as abscissa, and the 
next displacement as ordinate, we get a scatter of {di,di+i) in a log-log plot. The figure shows a high density of points near 
the diagonal line di+i = di, which means a strong positive correlation between consecutive displacements. 
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FIG. 5: (Color online) (a) The decay of Kendall's Tau tk be- 
tween di and di+m as a function of the interval m. (b) Pearson 

correlation coefficient Rm between log{di) and log{di+rn) as a 
function of the interval m. 



previous movements can continue a very long time. To 
ensure it, we plot the Pearson correlation coefficient Rm 
between log{di) and log{di+rn) ■ It does obey a power-law 
decay with a slope 7 = —0.367 (Fig. Eljb)). The value 
H = \ + 7/2 ~ 0.82 is the well-known Hurst exponent 
that denotes the long-term correlations in the fluctuation 
of the series [2^ . Using the method of detrended fluctua- 
tion analysis (DFA) [231 (^^e Appendix C), we also obtain 
a similar the Hurst exponent value H = 0.87, indicating 
a strong long-term correlation among the displacement 
series. 

We also studied two other correlations, namely the cor- 
relation among the series of staying times, and the cor- 
relation between the staying time and the displacement. 
The series of staying times show only a weak positive 
correlation (its tk — 0.120 with the confidence interval 
0.015), and the staying times and displacements are al- 
most independent (its tk = 5.30 x 10"'^). 



IV. MOBILITY PATTERNS AT THE 
INDIVIDUAL LEVEL 

The above discussions showed the scaling patterns and 
positive correlation of human movements at the popu- 
lation level. Nevertheless, since the above results are 
aggregated over all individuals, we can not directly con- 
clude that the movements of each individual also exhibit 
the same properties. Actually, power-law-like displace- 
ment distribution at the population level can even be ob- 
served in a system where all the individuals' movements 
are Poissonian [l^, |33|. Because of the lack of direct 
evidence, it remains controversial whether the scaling 
mobility patterns are universal at the individual level. 
Recently, Yan et al. reported the diversity of human 
mobility patterns at the individual level and that many 
individuals' displacement distributions usually are domi- 
nated by some frequently-appearing mobilities [T5l|. Due 
to the limitations in the original data sets of Yan's work, 
this conclusion still needs to be confirmed by more in- 
depth empirical studies based on datasets with higher 
resolution. 

Among the 100 remaining users, we choose the users 
who had more than 200 effective staying positions to 
study their mobility patterns at the individual level, 200 
being almost the lowest bound to obtained efficient statis- 
tical patterns. By doing this, 32 effective individuals with 
698 files and 15189 staying positions are chosen. The 
number of effective staying positions Us and the number 
of displacements Ud of each of the 32 users are listed in 
Table 1. 

Plotting the displacement distribution P{d) and cor- 
relation patterns {di,di+i) for each of the 32 users, we 
remark that users with stronger positive correlation seem 
to have usually a displacement distribution that is closer 
to a power law. The trajectories, displacement distri- 
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FIG. 6: (Color online) Trajectories (a, d), displacement distributions P{d) (b, e), staying time distributions P{ts) (insets in 
(b, e)) and correlation patterns (c, f) of two typical individuals (upper and lower three panels for individual No. 9 and No. 22 
respectively 



buttons and correlation patterns (d^jdj+i) of two typical 
users are shown in Fig. [6l User No. 9 has many long- 
range movements, and his/her displacement distribution 
obeys well a power law. Significant positive correlation is 
also observed. In contrast, the displacement distribution 
of user No. 22 is bimodal-like, and the correlation is also 
not obvious. 

The positive correlation reflects a gradually changing 
nature of human displacements. Previous studies in the 
temporal patterns have found that this gradually chang- 
ing process, or say the cascading effect, is much relevant 
to the emergence of burstiness in human activities [sij . 
as well as the long-term persistences [H, [s^ . Our results 
seem to indicate that the positive correlation in the dis- 
placements is related to the scaling properties in human 
mobility patterns. 

To prove this hypothesis, we need to test the relation- 
ships between the strength of the correlation and the 
form of the users' displacement distributions. 

Using the method introduced above, we first calculate 
the Kendall's Tau of the series di and di+i for each user, 
as shown in Table 1. Although all tk of the 32 users are 
positive, the value varies in a wide range from 0.2 to 0.5, 
showing a great diversity in the correlation. More than 
2/3 of all users (23/32) have the Kendah's Tau tk > 0.3 
and exhibit significant positive correlation. 



The correlation coefficients Ra of each user's displace- 
ment series are also calculated. Due to the heterogenous 
displacements, the logarithm of displacement \nd is used 
here, so Ra is defined as: 

p ((log(d,) - log(4))(log(rf.+i) - log(rf.))) 

where d* is the average displacement of the user and a 
is the variance of the displacement series d. The values 
of Ra for all 32 users are shown in Table 1. All of them 
are higher than 0.5, showing strong positive correlation 
in agreement with the above results for the method of 
Kendall's Tau. And also, to quantify the long-term cor- 
relations, we calculate the Hurst exponent H of the series 
log d of each user using DFA and find if > 0.5 for all of 
them (Table 1), showing significant long-term persistence 
on displacements. 

To check whether individual-level displacement distri- 
butions exhibit a power-law form, we plot these distri- 
butions and find that most of them seem to be power- 
law-like after log-binning. Here the Kolmogorov-Smirnov 
Test (KS Test) [s^l is used to test the power-law fits of 
these empirical data points. After estimating and setting 
a lower bound Xmm in the dataset, KS test will return 
confidence probability pi^g. Generally speaking, the big- 
ger pks is, the better the fit is. Table I shows pks of 
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the log-binning displacement distribution for each user, 
in which most of them have Pks > 0.1 and have a well- 
fitted power-law-like section. 

However, several users have very large estimated values 
for Xmin in the KS test, showing that the power-law-like 
section only covers a small range in the tail of P{d). We 
therefore fix x,nin to 10 meters to test if P{d) can be well 
fitted by a power law in all of the range. This yields the 
confidence probability p'^g. Unfortunately, in only a few 
users is the requirement p'j^g > 1 fulfilled, as shown in 
Table 1, indicating that for most of these users a power 
law is observed over only a small range. 

To quantify the differences between P{d) and strict 
power law, one can also directly fit the data points of 
P{d) to get the Pearson correlation coefficient i?^ be- 
tween the fitting curve and P(d)[3^. The better fitting 
corresponds to smaller negative values of R'^ due to the 
decaying power-law function, and Ra = — 1 for the P{d) 
that completely coincides with a power law. As shown in 
Table 1, all users' R'^ are less than —0.9. 

Now we have five quantities for each individual, tk, 
Ra and H for the correlations of user's displacements, 
PKS and R'^ for the quality of the power-law fitting. We 
plot six relationships of these quantities and respectively 
calculate their Kendall's Tau values, as shown in Fig. 
[71 Most of these correlations are significant, and in sup- 
porting of our previous guess that the scaling mobility 
patterns usually correspond to higher correlation of dis- 
placements. This result implies that the cascading-like 
processes play an important role in the emergence of the 
scaling properties in human movements. 

However, unlike the previous findings in human com- 
munications [3^ [33I I , the long-term correlations of move- 
lengths look independent of the power-law exponents of 
P{d) (The Kendall's Tau between a and H is -0.129 
with 95% significance level confidence interval 0.262). 

Similarly, we calculate the Pearson correlation coef- 
ficient i?( between the staying time distribution P(ts) 
and power law fits for each individual, as shown in Ta- 
ble 1. However, i?^ does not show significant correla- 
tions with R'^ and p'j^g (Kendall's Taus respectively are 
0.214 and -0.048 for the confidence interval 0.262), and 
weak negative correlations with Ra and tk (Kendall's 
Taus respectively are -0.266 and -0.262 with the confi- 
dence interval 0.262). Combining these results with the 
observation that the staying time does not correlate with 
the displacement at the population level, we infer that 
the effect of the dynamics on the staying time is rather 
unrelated to that on the displacement. 

At last, we compare the empirical correlation with 
the modeling results reported in Ref. [2^. This model 
can create power-law distributed displacements from a 
random-walk process on a hierarchical geographical net- 
work. This series of displacements has inherent positive 
correlation due to the cascading process on the hierarchi- 
cal organization. These modeling results and our empir- 
ical findings show high level of similarities for both the 
strength of the correlation and the pattern of its decay 
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FIG. 7: (Color online) Patterns show the correlations: (a) 
p'ks vs. TK, (b) p'ks vs. Ra, (c) p'ks vs. H, (d) R'i vs. 
TK, (e) R'i vs. Ra, (f) R'i vs. H. Kendall's Tau of these 
correlations respectively are (a) 0.367, (b) 0.246, (c) 0.278, 
(d) -0.274, (e) -0.286, (f) -0.254, with 95% significance level 
confidence interval 0.262. 



as the interval m increases, implying that the cascading 
effect is the common origin of both the scaling displace- 
ment distribution and the positive correlation. A more 
detailed discussion can be found in Appendix D. 



V. CONCLUSIONS AND DISCUSSIONS 

By analyzing the dataset of GPS carriers, we observe 
the scaling temporal-spatial properties in the aggregated 
human movements as well as individual-level diversities. 
The displacement distribution at the population level is 
well-fitted by a power law. However, the individuals' 
mobility shows much diversity: some of them display 
common scaling properties, but others are irregular, in 
agreement with several recent studies [l^ . 

Our most remarkable finding is the significance pos- 
itive correlation of the series of displacements both at 
the population level and at the individual level, show- 
ing that the gradually changing nature is wide-spread in 
human mobility. We surprisingly find that the strength 
of the correlation for each individual is significantly re- 
lated with their displacement distribution: the individu- 
als with stronger displacement correlation have a higher 
probability to possess a power-law-like displacement dis- 
tribution. This result is confirmed by four types of cor- 
relations (Fig. [7]) and implies that the cascading-like dy- 
namics is an important mechanism in the emergence of 
scaling properties of human mobility. Although the sam- 
ples in our analysis are not very big, this result is still 
highly believable, as most of the correlations/correlations 
well pass the test with 95% significant level and support 
each other. 

We notice that the displacements and staying times 
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TABLE I: Information and fitting parameters of the 32 individuals, where Us and are the number of effective staying 
positions and displacements of each user, and dmax are the average displacement and the maximum displacement of the 
user, and a is the fitting exponent of P{d) using the estimated lower bound Xmin- The definition of other parameters can be 
found in the main text. 









/in 


d /m 
'-''max / 










H 


J- 


'its 










Pks 




K 




1 


424 


407 


10980 


R7QR4.A 





.231 


Q 


085 





.360 


0. 


83 


4 37 


X 


10~ 


3 


49 9 


\ 


496 


1 21 X 10 


-17 


-0. 


978 


-0 989 


2 


278 


257 


12180 


ZOO loo 


0, 


.360 


Q 


108 





.500 





.85 


3 98 


X 


10~ 


1 


'^9n'^ ^ 
ozuo . o 


■y 


604 


1 (T^ V 1 n 


-7 


-0. 


983 


-0 980 


3 


747 


716 


14311 


OZU J- 


0, 


.312 


Q 


065 


0, 


.421 





.80 


6 42 


X 


10" 


1 


404 9 


\ 


469 


9 99 V ID" 


-26 


-0. 


989 


-0 993 




232 


220 


7301 


zo 1 yu 1 


0, 


.390 


g 


113 


0, 


.515 





.66 


6.76 


X 


10" 


1 


4477 9 


2 


084 


Q7 V in" 
o . y 1 X ±u 


-13 


-0. 


968 


-0 991 


g 


323 


310 


1373 


40117 


0, 


.327 


Q 


099 





.392 





.79 


2.41 


X 


10" 


1 


97 6 


\ 


547 


1 13 X 10 


-5 


-0. 


989 


-0 974 


g 


237 


228 


905 


21326 


0, 


.300 


Q 


111 


0, 


.325 





.55 


3.89 


X 


10" 


1 


152 5 


\ 


649 


9 Q1 V in 


-4 


-0. 


980 


-0 991 


9 


1036 


996 


3161 




0, 


.368 


Q 


056 


0, 


.504 





.84 


1.23 


X 


10" 


1 


9 9 


\ 


426 


1 21 X 10 


-1 


-0. 


994 


-0 993 


10 


563 


541 


2022 


iOoZOO 


0, 


.367 


Q 


075 


0, 


.457 





.86 


5.18 


X 


10" 


1 


98 6 


■y 


543 


•J . U 4: A J. iJ 


-7 


-0. 


992 


-0 997 


12 


299 


290 


1086 


21866 


0, 


.288 


Q 


100 





.348 





.51 


9.51 


X 


10" 


1 


97 6 


■y 


752 


7 "^9 V in" 

1 .O^ A J.U 


-IS 


-0. 


966 


-0 993 


15 


240 


218 


47632 




0, 


.187 


g 


116 


0, 


.227 





.72 


3.43 


X 


10" 


1 


351 14 


2 


065 


Q 9"? V in" 
y . zo A J.U 


-13 


-0. 


976 


-0 980 


22 


1050 


986 


3055 


60918 


0, 


.230 


Q 


055 





.344 





.69 


4.72 


X 


10" 


1 


5601 9 


3 


232 


1 10 X 10 


-24 


-0. 


964 


-0 972 


26 


2702 


2650 


370 


16074 


0, 


.376 


g 


034 


0, 


.546 





.85 


3.62 


X 


10" 


1 


15 2 


■y 


656 


n7 V in" 


-10 


-0. 


987 


-0 996 


27 


297 


287 


10732 


ooyyo4: 


0, 


.380 


g 


104 


0, 


.546 


1 


.00 


1.44 


X 


10" 


1 


35 4 


■y 


335 


1 71 X 10 


-3 


-0. 


985 


-0 986 


28 


729 


681 


5271 




0, 


.326 


g 


068 


0, 


.403 





.72 


1.74 


X 


10" 


7 


10 1 


■y 


277 


1 89 V 1 n 

J. . O Z A J. iJ 


-7 


-0. 


985 


-0 983 


29 


296 


279 


14295 




0, 


.188 


g 


103 


0, 


.217 





.70 


8.50 


X 


10" 


1 


132 7 


■y 


512 


9 Q9 V in" 
z . y z A J.U 


-17 


-0. 


984 


-0 986 


34 


243 


233 


1682 


34774 


0, 


.443 


g 


110 


0, 


.554 


0. 


.99 


8.35 X 10~ 


1 


264 9 


■y 


692 


1 V in 

O . -LU A J.U 


-5 


-0. 


975 


-0 967 


37 


243 


237 


731 




0, 


.231 


g 


110 





.307 





.78 


9.61 X 


10" 


1 


21 7 


\ 


692 


9 V in 


-5 


-0. 


982 


-0 983 


39 


731 


673 


1 138 


6481 


0, 


.153 


g 


068 


0, 


.244 


0. 


.53 


6.37 


X 


10- 


21 


10 


■y 


316 


fi n^i V in" 

D.UO A ±U 


-21 


-0. 


898 


-0 959 


40 


379 


365 


1234 


18140 





.518 


g 


093 





.655 





.88 


9.56 


X 


10" 


1 


2282 1 


2 


344 


1 "^9 V in" 

J- . OZ A J.U 


-11 


-0. 


963 


-0 996 


41 


234 


224 


1533 


33011 


0, 


.373 


0, 


.114 





.471 


0, 


.81 


1.06 


X 


10" 


1 


116.9 


1, 


.498 


4.49 X 10 


-5 


-0. 


,986 


-0.983 


42 


290 


278 


982 


37454 


0, 


.363 


0, 


.101 





.446 


0, 


.79 


7.77 


X 


10" 


1 


225.8 


1, 


.706 


2.24 X 10 


-4 


-0. 


,984 


-0.990 


43 


382 


357 


4419 


72620 


0, 


.369 


0, 


.090 





.438 


0, 


.91 


1.39 


X 


10" 


3 


10.7 


1, 


.293 


1.15 X 10 


-3 


-0. 


,987 


-0.989 


44 


361 


354 


212 


9105 


0, 


.333 


0, 


.091 


0, 


.544 


0, 


.88 


4.25 


X 


10" 


1 


20.8 


1, 


.827 


6.84 X 10 


-5 


-0. 


,979 


-0.982 


46 


215 


204 


2206 


19166 


0, 


.260 


0, 


.119 


0, 


.333 


0, 


.66 


2.58 


X 


10" 


1 


73.2 


1, 


.443 


1.44 X 10 




-0. 


,970 


-0.989 


52 


258 


248 


1497 


39341 


0, 


.242 


0, 


.106 


0, 


.308 


0, 


.54 


3.73 


X 


10" 


1 


867.4 


2, 


.023 


2.44 X 10 


-8 


-0. 


,981 


-0.992 


54 


823 


796 


8716 


1159507 


0, 


.442 


0, 


.064 


0, 


.578 


0, 


.82 


1.04 


X 


10" 


1 


9.9 


1, 


.408 


1.02 X 10 


-1 


-0. 


,992 


-0.993 


78 


319 


306 


4432 


151888 





.416 


0, 


.096 





.509 


0, 


.79 


8.21 


X 


10" 


1 


17.5 


1, 


.427 


2.63 X 10 


-2 


-0. 


,990 


-0.993 


79 


549 


530 


1038 


78978 





.444 


0, 


.076 





.568 


0, 


.81 


1.72 


X 


10" 


1 


12.7 


1, 


.490 


7.41 X 10 


-3 


-0. 


,991 


-0.985 


116 


272 


261 


567 


13700 


0, 


.390 


0, 


.105 


0, 


.412 


0, 


.82 


8.14 


X 


10" 


1 


10.9 


1, 


.525 


7.65 X 10 


-1 


-0. 


,986 


-0.986 


123 


247 


234 


6133 


255818 


0, 


.437 


0, 


.114 


0, 


.576 


0, 


.83 


9.76 


X 


10" 


1 


3999.2 


2, 


.024 


3.58 X 10" 


-13 


-0. 


,970 


-0.994 


134 


360 


340 


9839 


624928 


0, 


.348 


0, 


.094 


0, 


.416 


0, 


.86 


1.01 


X 


10" 


1 


47.5 


1, 


.339 


5.37 X 10" 


-10 


-0. 


,985 


-0.984 


137 


528 


483 


21151 


1291274 


0, 


.374 


0, 


.079 





.479 


0, 


.81 


6.66 


X 


10" 


3 


92.6 


1, 


.304 


8.13 X 10 


-7 


-0. 


,991 


-0.997 



are largely independent both at the population level and 
at the individual level, indicating that the mechanisms 
that drive the emergences of their scaling laws are also 
independent. This result is helpful for the modeling, as 
it indicates that we can divide the empirical findings into 
several classes that may have similar dynamics according 
to their correlations, and then can be modeled indepen- 
dently. 

Finally, the speed-displacement pattern shows the ab- 
normal transition from a linear to a sub-linear relation- 
ship (Fig. [3|), which may indicate the change of trans- 
portation from walks to automobile and the average 
longest walking distance in daily life. In addition, the 
impact on human mobility patterns due to traffic jams 
are observed here. 

In summary, we find that the positive correlation of 
the series of displacements that describes the cascading- 
like movements, is a characteristic factor that is relevant 
to the underlying mechanisms of the scaling of mobility 
patterns from the empirical analysis. Our findings and 
the methods used provide some useful insights for fur- 
ther empirical and modeling studies of human mobility 
patterns. 



Acknowledgments 

This work was supported by the National Natural 
Science Foundation of China Grants Nos. 11205040, 
11105024, 70971089 and 10635040. XPH acknowledges 
the Zhejiang Provincial Natural Science Foundation of 
China (Grant No. LY12A05003), the start-up founda- 
tion and Pandeng project of Hangzhou Normal Univer- 
sity. We thank Dr. M. Pleimling for his helpful discus- 
sions on this paper. 



Appendix A: Dataset descriptions and the 
judgement of effective stay positions 

The data used in this study has been provided by the 
Microsoft Geo-life project and contains over 2 years of 
GPS trajectories (from April 2007 to August 2009) of 
165 individuals. The datasets are available at the web- 
site: http:/ /research. microsoft. com/en-us/downloads 
/bl6d359d-dl64-469e-9fd4-daa38f2b2el3/. The GPS 
data was collected by different GPS handheld equip- 
ments or GPS phones. In most of them, the interval of 
recording time ranges from 2 to 5 seconds. The data 
set includes more than 10,000 trajectories, the total 
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FIG. 8: (Color online) Snapshot of the distinguishing of effec- 
tive staying positions. The figure left shows original trajectory 
of one GPS carrier. The figure right shows the effective stay- 
ing positions connected by lines in order, where each vertex 
represents an effective staying position. 



recording distance is more than 1 million kilometers, and 
the total recording time is more than 48,000 hours. The 
trajectories are widely distributed in the world, covering 
more than 30 cities in China, and several cities in North 
America, Europe, South-east Asia, etc. The movements 
recorded by the data set include not only trips to work 
or home, but also many daily-life activities, such as 
shopping, sightseeing, dining, hiking, and cycling, etc. 
The recording time for different individuals is different, 
and ranges from several weeks to several years. A 
trajectory file consists of a sequence of the records of 
trajectory points, and each record provides information 
on the latitude, longitude, and altitude of the position of 
the GPS holder, and the corresponding recording time. 

The data sets are composed by a series of geographic 
locations with corresponding time recordings ordered by 
the time sequence. They can not directly show the po- 
sitions that users really have stayed in, so first of all we 
should distinguish the effective stay positions from the 
record. We set the resolutions for distinguishing of stay- 
ing positions to 10 meters for the displacement which is 
the critical spatial resolution of a handheld GPS equip- 
ment, and 120 seconds for the time which is the interval 
of traffic signals. 

Consider a trajectory labelled by {Pi,P2,-- :Pn}, 
where a continuous sub-sequence {Pj,--- , Pk} (where 
^ < j < k < N) satisfies the following two conditions: 
the distances between two consecutive track points are 
less than 10 meters, and the total time length of the sub- 
sequence {Tj,--- ,Tk} is larger than 120 seconds. The 
average position of the sub-sequence is recorded as an ef- 
fective stay position, and ts = T^— Tj is the staying time 
of the stay position. As illustrated in Fig. 1, the average 
position SI of track points from P3 to P7 are considered 



as an effective stay point, as all the geographical distances 
from P3 to P7 are no more than 10m and T7 — T3 < 120s. 
The same holds true for S2 for the track points from PIO 
to P15. The straight-line distance between SI to S2 is 
set as the user's displacement for the movement from SI 
to S2. 

Most of the files in the data set only contain the records 
of few hours or minutes. Since the critical staying time 
in each stay position is set as 120 seconds, we usually can 
not obtain enough effective stay positions to achieve good 
patterns of user's mobility, and we therefore abandon all 
the files where the recording time is less than 6 hours and 
, we are left with 927 files from 100 users. Using the above 
algorithms, we distinguish the effective stay positions of 
each of the 100 users from the 927 files, which are used 
in our analysis at the population level. 

However, in our empirical analysis at the individual 
level, the number of effective stay positions of more than 
half of the 100 users is too small to extract its patterns. 
We thus remain with the data of 32 users with a number 
of effective stay positions that is larger than 200. Notice 
that we analyze the files of a same user one by one, and 
the statistical patterns of the user is aggregated from all 
of his/her files. 



Appendix B: Kendall's Tau 

In our empirical analysis, the displacements of the 
users are very heterogeneous, covering several orders of 
magnitude. Thus classical measurements like the Pearson 
coefficient are not suitable in analyzing the correlation of 
these displacements. We therefore use the rank-based 
correlation coefficient named Kendall's Tau. For two se- 
ries X = {xi,X2j ■ ■ ■ ,x„i} and y = {yi,?/2, • • • ,Vm}, the 
Kendall's Tau is defined as [36i] 

2 

'TK = —, -T V sgn[(a;, - Xj){y^ ~ y^)], (Bl) 

mm — 1 ^ — ' 

where sgn(x) is the signum function, which equals +1 
if a; > 0, -1 if a; < 0, and if x = 0. tk ranges from 
-1-1 (exactly the same ordering of x and y) to -1 (reverse 
ordering of x and y), and two uncorrelated series have 
Tk ~ 0. Obviously, as tk is calculated based on the 
order of the elements in two series, the magnitudes of 
differences on the value of the elements do not impact 

TK- 



Appendix C: Detrended fluctuation analysis 

The detrended fluctuation analysis (DFA) is a method 
proposed to evaluate the self-affinity of a time series in 
stochastic processes. It was first developed by Peng, et 
al. [29], and is helpful to reveal the extent of long-term 
correlations of a time series. Using the DFA method. 
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the Hurst exponent can be derived through the following 
procedures. 

i) Considering a time series {dt,t G N}, we first need 
to calculate the integration D{t) of the time series, 



-D(0 < dt >). 



(CI) 



where < • > means an average over all t's. 

ii) Then divide D{t) into mutually disjoint boxes of 
size At. 

iii) In each box, using the least square method, we can 
get a n-order polynomial fit Dfit{t), which is called the 
n-order trend. The residual series, in which the trend 
has been eliminated, can be derived by applying a sub- 
traction. 



Yit) = D(t) ^ Df,t{t). 



(C2) 



iv) Calculate the mean square error of each box over 
the size At after eliminating the trend. 



At 

E{Atr = Y.Yitf 
t=i 



(C3) 



v) Next calculate the root-mean-square deviation, or 
say fluctuation, over different (Ai)'s. 



FiAt) = \ll-^E{Atr. 



(C4) 



vi) If the time series {dt} satisfies a power-law distri- 
bution, the quality F{At) will also follow a power-law 
increasing function. 



F{At) ^ {Atf, 



(C5) 



where H is the Hurst exponent that we want to calculate. 
Here H = 0.5 represents the time series is completely 
uncorrelated, and 0.5 < H < 1.0 indicates the time series 
is of long-term correlation. 



Appendix D: Comparison with the purely 
cascading-like mobility process 

Our findings show the significant positive correlation 
among the displacements of human mobility. It is neces- 
sary to compare these empirical results with the modeling 



displacement series with inherent positive correlation and 
scaling properties. Here the hierarchical-traffic-system 
model reported in Ref. [13] is considered. 

The basic rules and results of the model can be briefly 
introduced as follows: Firstly we create a hierarchical 
geographic network on a two-dimensional plane. In the 
plane, K top-layer nodes, K{M — I) 2nd-layer nodes, 

KM'^-^iM - 1) nth-layer (n > 2) nodes, 
and KM^^'^{M — 1) A^th-layer nodes are randomly dis- 
tributed on the plane. Each node is then connected to its 
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FIG. 9: (Color online) The decay of Kendall's Tau tk between 
di and di+m as a function of the interval m for modeling series 
(a) and No. 9 user (b). 



nearest up-layer node. For the nth-layer node, its weight 
is Wn = r^~", where r > 1 presents the upper layer 
nodes that have more attraction for agents. After the 
construction of the hierarchical network, an agent ran- 
domly walks on it. The probability that the agent will 
move to a neighboring city is proportional to its weight. 
Simulations and analytical results show that the agent's 
displacement distribution obeys power law with exponent 
^ = 3-4 log^j r. When M = 9 and r = 2, 13 fa 1.7, which 
is very close to the empirical findings. 

Obviously, due to the hierarchical organization, the 
probability that walkers directly move from a top-layer 
node to a bottom layer node is small. Since the long 
range movements only appears between two higher layer 
nodes, the displacement of the agents is gradual changed 
and has inherent positive correlation. We calculate the 
Kendall's Tau of the displacement series, and its value 
is 0.39, very close to our empirical results. In addition, 
Tk of the modeling series also shows the power law de- 
cay when m increases, as shown in Fig. [Hja), which is 
comparable with Fig. [S) 
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